# data_files

While the raw CTS data are not included in this replication file, I have included a 1\% random sample.  Note, in the paper, we use a 20\% random sample as well as a 50\% random sample for the regional specifications.  In addition, the sample differs from the raw data by the creation of the variable, "id."  As specified in the do file 02_clean_e_labelling.do, id is a generated variable from unique loannum and zip codes.  Therefore, we drop loannum from the replication sample data in order to preserve privacy protection.   
